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Mouse-NL, a Commensal Bacterium with a Key Role in Postnatal 
Maturation of Gut Immune Functions 
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"Candidatus Arthromitus" sp. strain SFB-mouse-NL (SFB, segmented filamentous bacteria) is a commensal bacterium necessary 
for inducing the postnatal maturation of homeostatic innate and adaptive immune responses in the mouse gut. Here, we report 
the genome sequence of this bacterium, which sets it apart from earlier sequenced mouse SFB isolates. 
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Candidatus Arthromitus" spp., also known as segmented fila- 
mentous bacteria (SFB), are noncultivable, spore-forming, 
Clostridia-related commensal bacteria that colonize the digestive 
tracts of many animal species (1). SFB typically form filaments 
that are solidly anchored in gut epithelial cells (2), notably in the 
ileum. In mice, SFB play a key role in the postnatal maturation of 
gut innate and adaptive immune functions, and notably, they can 
induce a strong IgA response (3, 4) and the recruitment and acti- 
vation of intraepithelial CD8 + Tlymphocytes (5) and lamina pro- 
pria CD4 + T cells (6, 7; reviewed in reference 8). 

Here, we report the complete genome sequence of the mouse- 
specific isolate "Candidatus Arthromitus" sp. SFB-mouse-NL. 
While an earlier sequenced mouse SFB isolate from Japan was 
shown to elicit a T helper (Th) 17 immune response in the intes- 
tinal lamina propria of C57BL/6 mice (7), the present strain, an 
independent isolate from The Netherlands (9), was reported to 
not only induce a strong Thl7 response but also to foster Thl, 
Th2, and regulatory T-cell responses in C3H/HeN mice (6). To 
define whether the observed differences may be related to genetic 
variations between the two SFB isolates, we determined the ge- 
nome sequence of strain SFB-mouse-NL. 

DNA was isolated from fecal material from SFB-mono- 
colonized mice using a modified version of the protocol described 
in Morita et al. (10). The DNA was sequenced using Illumina 
paired-end sequencing technology. The sequence reads were fil- 
tered using CLC and BOWTIE (11) to eliminate mouse genome 
sequences, and they were assembled using CLC, ABySS (12), and 
SOAPdenovo (13), yielding 40 scaffolds > 1,000 bp with high 
BLASTn scoring to existing SFB genome sequences. Finishing was 
performed using GAP4 of the Staden package (14) through itera- 
tive selection of read pair mapping to scaffold extremities and 



independent de novo assembly of these reads by SOAP. The scaf- 
folds were also ordered using Mauve aligner (15), with earlier 
published SFB genome sequences as the reference, and some se- 
quence gaps bridged by PCR performed on the basis of this align- 
ment and sequencing of the PCR products. Genome annotation 
was performed using RAST (16). 

The genome of "Candidatus Arthromitus" sp. SFB-mouse-NL 
consists of one circular chromosome (1,654,902 bp) with 1,598 
predicted coding sequences (CDS). The genome contains 4 (par- 
tial) prophages. A comparative analysis of the "Candidatus Ar- 
thromitus" sp. SFB-mouse-NL genome and two complete and 7 
incomplete other SFB genomes retrieved from NCBI (http:// 
www.ncbi.nlm.nih.gov/genomes/) using MUMmer (17) and 
Mauve (18) revealed that the SFB-mouse-NL genome is distinct 
(0.4% of overall nucleotide divergence) from the other genomes 
(which show <0.2% of overall nucleotide divergence among 
them). 

Nucleotide sequence accession number. The genome se- 
quence of" Candidatus Arthromitus" sp. SFB-mouse-NL has been 
deposited in GenBank under the accession no. CP008713. 
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